Scale-free networks as preasymptotic regimes 
of superlinear preferential attachment 
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We study the following paradox associated with networks growing according to superlinear prefer- 
ential attachment: superlinear preference cannot produce scale-free networks in the thermodynamic 
limit, but there are superlinearly growing network models that perfectly match the structure of 
some real scale-free networks, such as the Internet. We obtain an analytic solution, supported by 
extensive simulations, for the degree distribution in superlinearly growing networks with arbitrary 
average degree, and confirm that in the true thermodynamic limit, these networks are indeed de- 
generate, i.e., almost all nodes have low degrees. We then show that superlinear growth has vast 
preasymptotic regimes whose depths depend both on the average degree in the network and on how 
superlinear the preference kernel is. We demonstrate that a superlinearly growing network model 
can reproduce, in its preasymptotic regime, the structure of a real network, if the model captures 
some sufficiently strong structural constraints — rich-club connectivity, for example. These findings 
suggest that real scale-free networks of finite size may exist in preasymptotic regimes of network 
evolution processes that lead to degenerate network formations in the thermodynamic limit. 

PACS numbers: 89.75.Fb, 89.75.Hc, 05.65.+b 



I. INTRODUCTION 

Models of complex networks can be roughly split into 
two classes: static and growth models. Static models, 
such as classical random graphs [1[ and their generaliza- 
tions [1, EJEl i generate a whole network at once, trying 
to directly reproduce some properties observed in real 
network snapshots. Growth models, e.g., preferential at- 
tachment [f|, construct networks by adding a node at a 
time, attempting to provide some insight into the laws 
governing network evolution. Compared to static mod- 
els, it is generally more difficult to closely match observed 
network properties with growth models, because in this 
case one usually has less direct control over the properties 
of modeled networks. 

The first growth model that matched the observed In- 
ternet topology surprisingly well, across a wide spectrum 
of network properties, was the positive-feedback prefer- 
ence (PFP) model by Zhou and Mondragon Q. In the 
model, at each time step, one node is added to the net- 
work, and connected to the existing nodes by two or three 
links, choosing different link placement options with dif- 
ferent probabilities. The most important property of the 
model is that the probability to connect a new node to 
the existing nodes of degree k is a superlinear function of 
k. Although there are many other models of the Internet 
evolution, e.g. 0, d, 0, the PFP model stands apart as 
it gives rise to the following unresolved paradox. On the 
one hand the model matches perfectly the observed Inter- 
net, while on the other hand, since it is explicitly based 
on preferential attachment with a superlinear preference 
kernel, it cannot produce, in the thermodynamic limit, 
any scale- free networks jlCf . including the Internet. 

Here we resolve this paradox by showing that superlin- 
ear preferential attachment can have vast preasymptotic 



regimes. Specifically, we first find an analytic asymp- 
totic solution for superlinearly growing networks with 
arbitrary average degree, confirming that the asymptotic 
regime is indeed degenerate — regardless of the average 
degree, only a finite number of nodes have high degrees 
(Section |TTJ) . However, in Section HTT1 we show that if the 
preference kernel is not too superlinear and if the aver- 
age degree is not too low, then this asymptotic regime 
becomes noticeable only at network sizes that are orders 
of magnitudes larger than the size of any real network, 
including the Internet. We thus half-resolve the paradox 
by showing that the PFP model can, in fact, match the 
Internet. Section llVl resoives the other half, by explaining 
why the model does so: its design implicitly reproduces 
the degree correlations in the Internet, which are known 
to define almost all important topological properties, ex- 
cept clustering 0, Q . We conclude in Section [V] with an 
outline of our findings and their implications. 



II. ASYMPTOTIC DEGREE DISTRIBUTION 

In this section we derive the analytic solution for the 
degree distribution of superlinearly growing networks 
(SLGNs) in the thermodynamic limit. We begin by re- 
calling what is known for networks grown by adding a 
single link per node (the average degree k sa 2), and then 
generalize to the case with multiple links. 



A. Single link per node 

The case when a new node attaches to exactly one 
existing target (or host) node is well-studied [lOL [llL fl3| - 
Let the probability that the new node selects a host node 
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of degree k be 



New node 



New node 



N 



(1) 



where the summation is over all N existing nodes and kj 's 
are their degrees. Then the asymptotic degree distribu- 
tion is a stretched exponential for sub-linearly growing 
networks (S < 1) and a power law for linearly growing 
networks (8 — 1). Supcrlinearly growing networks with 
8 > 1 are asymptotically star graphs. 

Specifically, if 8 > 2, then the number of nodes with 
degrees k > 1 remains finite in the thermodynamic limit 
N — > oo, meaning that almost all nodes have degree 1, 
Ni(N) w N. If 3/2 < 8 < 2, then the number N 2 (N) 
of nodes with degree 2 (degree-2 nodes) grows as N 2 ~ 5 , 
while the number of nodes with degrees k > 2 remains 
finite. If 4/3 < <5 < 3/2, then N 3 (N) ~ A 3 " 25 , and the 
number of nodes with degrees k > 3 is finite. In other 
words, there is an infinite series of "phase transitions" at 
critical values 8 P = 1 + 1/p, where p = 1,2,3,..., and the 
degree distribution in SLGNs with 8 lying between these 

i (d = oo), is given by 



critical values, 8 P < 8 < 8. 



N k /N 



' N (k-l)(l-8) ifl<Cfc<Cp; 

1/N otherwise. 



(2) 



In what follows we also consider the extremal growth 
rule, which formally corresponds to the 8 — > oo limit, and 
specifies that a new node attaches to the existing node 
with the maximum degree. If there are several nodes with 
the same maximum degree, then the host node is ran- 
domly selected among them. SLGNs grown according to 
this rule stay stars throughout their evolution, assuming 
they are stars at the beginning. If an SLGN is not ini- 
tially a star, then extremal growth evolves it to almost a 
star, with all new nodes attaching to a maximum-degree 
node in the initial graph. 

Adding one link per node results in growing trees, 
which are not good models of real complex networks that 
all have strong clustering. But even if we are not con- 
cerned with the models' realism, there is another reason 
to consider SLGNs with multiple links added per new 
node. 

While for sub-linearly growing networks, adding more 
than one link should not qualitatively change the degree 
distribution, this modification may have a more promi- 
nent effect on the degree distributions in superlinearly 
growing networks. Indeed, the more links per node we 
add in SLGNs with a finite <5, the stronger the deviations 
from stars we obviously expect to observe. In view of 
the PFP model paradox, one might even start suspect- 
ing that multiple links may resurrect power laws. We 
thus have to exercise more care dealing with multiple-link 
SLGNs. In what follows, we first consider them under the 
extremal growth rule, and then remove this restriction. 




(a)Two hosts (b)One host 

FIG. 1: Link placement options for two links. 

B. Multiple links per node. Extremal growth. 

We denote by m the number of links added per new 
node. The PFP model uses a superposition of the m = 2 
and m — 3 cases, and a combination of the following two 
link placement options: place a link either between the 
new and host nodes, or between the host and another 
existing node, called the peer node. Links are always 
placed such that the subgraph induced by the new links 
is connected and contains the new node, so that the net- 
work stays connected at each time step. For concreteness, 
we shall assume that m is a fixed positive integer, and 
consider cases with different m separately. Another im- 
portant restriction is that we construct simple graphs, 
i.e., self- loops and multiple links between the same two 
nodes are not allowed. 

We first focus on the case with m = 2. In this case we 
have only two options to place two links (see Fig. [TJ: 
place both links between the new and host nodes, or 
place one link between the new and host nodes, and 
place another link between the host and peer nodes. 
The both options, or any their superposition, produce 
the same result. Let the initial network be two discon- 
nected nodes. Adding the third node according to the 
extremal growth rule creates the star graph with three 
nodes. We shall represent our graphs by their degree 
sequences (ki, . . . , k^). The degree sequence represen- 
tation turns out to define, up to an isomorphism, the 
graphs grown according to our extremal growth rule. The 
star graph after the first step is (2, 1, 1) in this represen- 
tation. Applying the extremal growth rule to add the 
fourth node, we obtain (3, 2, 2, 1), and then (4, 3, 2, 2, 1), 
(5,4,2,2,2, 1), and generally 



(N —l,N ■ 



(3) 



We prove ([3]) by induction. We already checked its 
validity for small N. Assuming that ([3]) holds for some 
N > 4, we establish it for N+l. If we plac e the two links 
according to the first option, shown in Fig. 1(a) then the 



new node attaches to the nodes with degrees N — 1 and 
N - 2. Thus degree N - 1 ^ N and N - 2 t-^ N - 1, 
the new node acquires degree 2, other degrees do not 
change, and the new graph has indeed the same structure 
@. If we use the other link placement option shown in 
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New node 



New node 



FIG. 2: Open 2-book (8,7,2,2,2,2,2,2,1) 



Fig. l(b)| we must choose the node of degree N — 2 as 
the host node. We cannot attach the new node to the 
node of degree N — 1 because this latter node is already 
connected to all other nodes, and therefore we cannot 
add the second link between this hub node and any peer 
node. Selecting the node of degree N — 2 as the host, 
we notice that it is connected to all other nodes, except 
the degree- 1 node. Therefore this latter node is the only 
choice for the peer node. Hence N — 2 i— > N and 1 i— ► 2, 
the new node acquires degree 1 , and other degrees do not 
change. Thus the new graph has the same structure ([3]). 

We shall call the graphs series j3|) the open 2-books. 
The justification for this name is as follows. The link 
between the two nodes of highest degrees, denoted by A 
and B, is the binding of an open book. Each degree-2 
node is connected to A and B, and the resulting triangle 
is a page. Thus, an open 2-book contains N — 3 triangular 
pages. Finally, the link between the highest-degree node 
A and the dangling degree- 1 node is a built-in bookmark. 
The open 2-book graph with N = 9 nodes is shown in 
Fig.H 

Note that we can call a star an open 1-book. It does 
not have bookmarks, its binding is the hub node, and its 
N — 1 pages are all the links. 

We now move to the case with three links added per 
new node, m = 3. Generalizing the link placement op- 
tions for two links, there are four options for placing 
three links, shown in Fig. [3] Choosing the first option 
with three host nodes (Fig. 3(a)), the application of the 



extremal growth rule to the initial graph (0, 0, 0) yields 
the graph series (3,1,1,1), (4,3,2,2,1), (5,4,3,3,2,1), 
(6, 5, 4, 3, 3, 2, 1), (7, 6, 5, 3, 3, 3, 2, 1), and generally 



(N- 1, N -2, N -3, 3,..., 3, 2,1) 



(4) 



JV-5 



Using the same logic as in the to = 2 case, one can prove 
that the extremal growt h ind eed produces ((4|. The link 
placement option in Fig. 3(c) results in exactly the same 
graph series. Placing links as in Fig. |3(d)[ we obtain 
almost the same graph series, except that the first graph 
is (2, 2, 1, 1). The option in Fig. 3(b)| leads to a different 
graph series, but almost all nodes still have degree 3. 




(a)Three hosts 
New node 



(b)Two hosts 
New node 




(c)One host 



(d)One host 



FIG. 3: Link placement options for three links. 



We call the graph series ([4]) open 3-books. The binding 
is the triangle ABC connecting the three nodes A, B, 
and C of highest degrees N — 1, N — 2, and TV — 3. 
Each page is a tetrahedron ABCD, where D is one of 
the TV — 5 degree-3 nodes. Thus, an open 3-book has 
N — 5 tetrahedral pages. It also has two bookmarks: 
triangle ABE and link AF, where E and F are the nodes 
of degrees 2 and 1 . 

Generalizing to an arbitrary to, we notice that there 
are combinatorially many possibilities to place to links. 
In general, they lead to different graph series. For con- 
creteness, in the rest of this paper we focus on the sim- 
plest option with no peer nodes and to hosts, i.e., the 
generalization of Figs. l(a)|3(a) In this case, if N is suf- 
ficiently large, i.e., N > 2m, then the resulting graphs 



(N — 1, . . . , N — m, to, . . . , to , to 

JV-2m+l 



1,- 



(5) 



These graphs are open m-books. If we imagine them 
placed in an m + 1-dimcnsional ambient space, then they 
contain: 

• one 2-codimensional binding, i.e., the to— 1-simplex 
A\ . . . A m composed of the to highest-degree nodes; 

• N — 2to+1 1-codimensional pages, i.e., TO-simplices 
A\ . . . A m D, where D is one of the N — 2m + 1 
degree-TO nodes; 

• to — 1 bookmarks of codimensions 2, 3, . . . , to, i.e., 
to — 1-, to — 2-, ... , and 1-simplices (one simplex of 
each dimension) , composed of links interconnecting 
highest-degree nodes and nodes of degrees k < to. 

The notion of an open book appears in mathemat- 
ics [l3j], where it finds various applications, e.g., as a 
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tool to establish connections between contact geome- 
try and topology. In its simplest definition, an open 
book is a fibration of a manifold by a collection of 1- 
codimcnsional submanifolds (pages), joined along a 2- 
codimensional submanifold (binding). Open books with 
bookmarks (formal definitions are obvious) seem natural, 
and perhaps they will find applications, too. 



C. Removing the extremal growth restriction 

In this section we outline the logic behind removing the 
extremal growth condition. (A more detailed exposition 
is presented in the Appendix.) We assume that 8 has a 
finite value. First, we estimate the probability that an 
SLGN remains an open book. We then characterize the 
deviations from the open book structure. For clarity, we 
consider the simplest case with m — 2 and the first link 
placement option in Fig. 



1(a) 



Consider a network of large size j, so that j ~ j — 
1 ~ j — 2 w j — 3, and suppose that it is an open 2- 
book ([3]) . Avoiding multiple links between the same pair 
of nodes, the probability [P^wj+i that after adding one 
node the network preserves its open book structure is 
approximately 



3 s +3* 



3 



f + j 5 + j ■ 2 s j s +j 



■2 s 



(0) 



Indeed, the first factor is the probability that one of the 
two nodes of degree w j is selected as the first host, 
while the second factor is the probability that the other 
such node is selected as the second host, in which case 
the network preserves its approximate open book struc- 
ture. Using ^ we estimate the probability Tat that upon 
reaching size A the network is still an open 2-book 



N 



N 

n 



5-1 



1 



(f) 



,5-1 



finite in the limit A 



2-8 



if 8 > 2; 

if S = 2; (7) 

if 8 < 2. 



We thus see that if 8 is sufficiently large, viz. 8 > 2, then 
there is a finite probability that the network preserves its 
open 2-book structure throughout the entire evolution. 
This observation implies that even if it is not an open 
book, the distortion of the open book structure is finite, 
e.g., a finite number of nodes have degree k > 2, degrees 
of nodes A and B in Fig. [2] are respectively lower, etc. 

However, if 8 ^ 2, the network is not an open book 
with high probability. But even though the exact open 
book structure is almost surely destroyed, the distor- 
tion is still asymptotically small and admits analytic es- 
timates. Indeed, let us first estimate the number N 3 (N) 
of degree-3 nodes in an A-sized SLGN with m = 2 and 
8^2. This number grows if instead of connecting to the 



highest-degree node with probability JV^jv+i in Eq. ([6]), 
the new node selects the other option and connects to a 
degree-2 node with probability 1 — Tn^n+i- Therefore 



dN 3 
dN 



1-T 



5-1 



(8) 



where we have neglected loss terms describing the de- 
crease of the number of degree-3 nodes due to new nodes 
connecting to them and changing their degrees to 4 or 5. 
These loss terms, as well as corrections to the approxi- 
mate expression for Vn^n+i in ©> are sub-leading, as 
we show in Appendix. The integration of Eq. ([5]) gives 



AWAr , 61nA if 8 = 2: 
^HaA" if*<2, 



(9) 



which we juxtapose against simulations in Section IIIII 
We thus see that the number of degree-3 nodes grows 
sublinearly with A, and consequently their proportion in 
the thermodynamic limit is infinitesimal. We also note 
that the solution in Eq. © allows us to compactly rewrite 
Eq. (0) as 

0V~ e -W). (10) 
The obvious generalization of ([8]) for higher degrees is 



dN k N k ^ 



dN 



(11) 



Solving recursively yields the connectivity transitions 
quite similar to those in the m = 1 case ([2]) 



N k /N. 



N ( k -2)(1-S) 

1/A 



if 2 k ^ p 
otherwise, 



1: 



(12) 



for any 5 such that 8 p < 8 < <5 p -i, where S p = 1 + 1/ p and 
p = 1,2, 3,.... The only difference between the degree 
distributions for the m = 1 and m = 2 cases (Eqs. © 
and (fl"2")) ) is that the latter is the former shifted along 
the &-axis to the right by 1 (Eq. (12]) is Eq. j2]) with 
fe i-> k- 1). 

Therefore, the same infinite series of connectivity tran- 
sitions appear for any m 1, and the asymptotic degree 
distribution is given by 



N H /N. 



N (k-m)(l-6) 

1/A 



if m ^ fc ^ p 
otherwise. 



i; 



(13) 



III. PREASYMPTOTIC REGIME 

We have shown that all SLGNs are asymptotically 
open books, while Zhou and Mondragon 0] showed that 
a specific SLGN of a finite size exhibited clean power 
laws. Another apparent disagreement is that according 
to our analysis, N k /N — > for all k > m, while the PFP 
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model simulations show that Nk ~ N for all k. The 
explanation of these paradoxes lies in the fact that the 
PFP model has a vast preasymptotic regime, and both 
the Internet size and sizes achievable in simulations lie 
deep within this regime. In this section, we describe two 
main factors that render this regime extremely vast for 
the PFP-modeled Internet. 

The first factor is that 8 in the PFP model exceeds 1 
only slightly (specifically, 8 « 1.15 in [6|]). For clarity, let 
us focus on the following concrete example. The propor- 
tion of degree-3 nodes N 3 /N in the m = 2 case scales as 
A^ 1- *, so that if 8 is close enough to 1, then the deviation 
of N 3 (N) from the linear growth may be hard to observe 
for insufficiently large N. Indeed, N — 10 4 (the order 
of the Internet size) and 8 = 1.15 substituted in Eq. © 
yield N 3 /N « 0.98, contradicting the assumption made 
to derive Eq. © that the network is almost an open 2- 
book and hence N 3 /N <C 1. This contradiction means 
that we are very far from the asymptotic regime. Even 
if we choose N = 10 10 (almost two autonomous systems 
per person!), the ratio N 3 /N goes down only to 12%, so 
it is still far from negligible. To get it down to 1%, we 
would need N = 10 17 , non- achievable in simulations. 

The second factor deepening the preasymptotic regime 
is m > 1. The larger m, the slower the decay of Nk/N ~ 
^y(fc— m)(i— *) f or > TOj t ne deeper the preasymptotic 
regime. For example, using the results from [ll| for 
N 3 (N) in the m = 1 case, we find that the Internet 
size N = 10 4 and 8 = 1.15 yield N 3 /N « 0.24, and to 
get it down to 1%, we would need only N = 10 8 , while 
N = 10 10 makes it 0.4% — all the numbers are substan- 
tially lower than in the m = 2 case. 

We juxtapose these analytic estimates with simulations 
in Figs.0]and[5l showing the proportion of degree-3 nodes 
N 3 /N and the overall degree distribution Ylk'>k Nk/N in 
SLGNs of different size N, grown with different 8 and m. 
For each combination of (N, 8, m) we average the results 
over a number of graph instances ranging from 3 for the 
largest size N = 10 5 to 100 for smaller N, We select the 
values of 8 = (S p + 8 p -i)/2, 8 P = 1 + 1/p, p = 1, . ..,7 
(<J = 3 for p = 1), so that the selected 8- values lie within 
the connectivity transition intervals discussed above. For 
p = 7, 8 = 1.15, i.e., the 5-value used in [f|. 

In Fig. U] the cases with m = 1 and m = 2 confirm the 
expected: the larger 8, the more quickly the proportion 
N 3 /N approaches our analytic prediction of its asymp- 
totic scaling. Comparing m = 1 and m = 2, we see that 
in the former case, only for 8 = 1.15 does N 3 /N stay con- 
stant for all graph sizes N achieved in our simulations, 
while in the latter case (m = 2), this ratio is constant 
for higher 8- values (8 = 1.23) as well. We see that for 
small <5's, the scalings of N3/N are much farther from 
their asymptotes in the m = 2 case than in the m = 1 
case. The m = 3 plot confirms that N 3 /N quickly satu- 
rates to a dependent constant that increases with 8, while 
for m = 6, N s /N decays as expected, ~ l/N, with the 
stronger fluctuations, the smaller 8. 

Two factors contribute to the discrepancies between 
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FIG. 4: (Color online) Scaling of the proportion of degree- 
3 nodes N3/N in SLGNs with different S and m. The solid 
lines are the analytic predictions for the leading term from 
[IH for m — 1 and Eq. <(9j for m — 2. The dashed lines are 
simulations. The dotted line is l/N. 



the analytic predictions and simulations in Fig. [4] First, 
we neglected loss terms in Eq. ([9]) . Taking those into ac- 
count would yield, for m = 2, the asymptotic expansion 



N 3 (N) =aN 



2-5 



bN 



3-25 



cN 



4-38 



(14) 



where b and c are some constants that depend on 8. For 
8 = 1.15 this expansion turns into 

N 3 (N) =aN - 85 -bN - 7 + cN°- 55 + ... (15) 

explaining why keeping only the leading term in the 
asymptotic result may lead to huge errors for small <5 
and N. 

The second discrepancy factor is that all the Nk (N) an- 
alytic estimates above are actually the average values of 
the corresponding random quantities. Nothing is known 
about fluctuations of the degree distribution, the analy- 
sis of which is difficult even in the simpler case of linear 
preferential attachment [14j . 

Fig. [5] provides a more global view of the dependency 
of the degree distribution on 8 and m. The higher 8, 
the more skewed the degree distribution and hence the 
more star-like the graphs. For N — 10 5 , 8 — 3, and 
m = 1, all the graph instances in our simulations are 
stars. The larger m, the closer the degree distribution 
curves corresponding to different N are to each other 
(neglecting the size-dependent cut-offs exhibited by all 
graphs), the straighter these lines, and thus the weaker 
the dependency of the degree distribution shape on the 
network size, and the deeper the preasymptotic regime. 
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FIG. 5: (Color online) Scaling of the degree distributions in SLGNs with different 8 and m. The lines show the complementary 
cumulative distribution of node degrees (^2 k i >k Ny /N) measured in simulations. 



IV. RICH CLUB CONNECTIVITY VERSUS 
JOINT DEGREE DISTRIBUTION 



We have shown in the previous section that the power 
laws empirically observed in the PFP model do not con- 
tradict the asymptotic open book structure of SLGNs, 
since typical network sizes considered in simulations are 
preasymptotically small. However, this argument does 
not explain why the PFP model almost exactly repro- 
duces not only the power-law degree distribution ob- 
served in the real Internet, but also a long list of other 
important network properties. Since the preasymptotic 
regime is not amenable to straightforward analytic treat- 
ment, in this section we approach the problem from a 



different angle, and provide a simple explanation based 
mostly on previous empirical work. 

We first notice that the fact that the PFP model ex- 
hibits preasymptotic power-law behavior is not so much 
surprising, because for 6=1 the model produces asymp- 
totic power laws, and this asymptote is quickly achieved 
for small N. The results of the previous section indicate 
that if 8 g2 1 and m > 1, then this power-law asymptotic 
behavior unnoticeably changes to preasymptotic, slowly 
transforming into the new asymptotic behavior only for 
very large N. 

Yet this argument does not explain why the PFP model 
reproduces so many other network properties observed 
in the Internet. Previous work & II EE EE E3 shows 
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that the degree distribution alone does not fully de- 
fine all other Internet's properties, i.e., the Internet is 
not llf-random in the terminology of [3], but is almost 
2i4f-random — its structure is very close to the struc- 
ture of maximally random graphs constrained by its 2- 
point degree correlations, or the joint degree distribution 
(JDD) defined by the total number Nkk' of links between 
degree-fc and degree-fc' nodes. In other words, the Inter- 
net's JDD narrowly defines almost all its other important 
properties, except clustering H, El- 

Although the PFP model is not concerned with the 
JDD per se, it reproduces precisely the observed rich club 
connectivity (RCC) <p(r/N) defined as the ratio of the 
number of links in the subgraph induced by the r high- 
est degree nodes to the maximal number of such links 
u). The values of ip(r/N) observed in the Internet for 
small r are substantially higher than in networks grown 
according to linear preferential attachment. Superlin- 
ear preference increases the connectivity density among 
high-degree nodes, which explains why the PFP model 
successfully captures the observed RCC. 

In the rest of this section we analyze the relationship 
between the JDD and RCC. Specifically, the JDD almost 
fully defines RCC: any two graphs with the same JDD 
have almost the same RCC. While the converse is gen- 
erally not true, a given form of RCC introduces certain 
constraints to the JDD. Given the JDD's definitive role 
for the Internet topology, we conclude that reproducing 
Internet's RCC must significantly improve the accuracy 
in capturing all other properties of the Internet topology 
that depend on degree correlations, which explains the 
success of the PFP model and provides clear grounds for 
the discussion in [l8|, . 

To see that the JDD almost fully defines RCC is 
straightforward [2(J. We first get rid of the node rank 
r in (p(r/N). The rank of a node is its position in the 
degree sequence sorted in decreasing order, i.e., as in ([3]). 
Recall that the node rank is essentially the complemen- 
tary cumulative distribution function for node degrees: if 
di and r, are the degree and rank of node i, fc max is the 
maximum degree, and if we denote N£ — Xfc^fc ^fc' > 
then 1 + N+ +1 < r z < JV+. Thus, the JDD and RCC 
are directly related via ip k defined as the total number 
of links between degree-fc nodes and nodes i of higher 
degrees di ^ fc 

^ = (^)^ + /^)-(^2 +1 )^ + + i/^) 

fcmax 

= J2N kk ,. (16) 

fc'=fc 

It follows that the JDD defines RCC, up to reordering of 
nodes of the same degree. 

To illustrate how the RCC constrains JDD, we choose 
to consider a common projection of the JDD, the av- 
erage degree of the nearest neighbors of degree-fc nodes 
fcnn(fc)- We first look at the maximum and minimum 



possible value of k nn (k) for a class of graphs with some 
fixed degree distribution with minimum and maximum 
degrees of 1 and fc max - We then suppose that (p k is also 
given as a constraint, and we quantify how this constraint 
narrows down the spectrum of possible values of k nn {k). 

It is easy to see that the minimum and maximum val- 
ues of k nn (k) without the ifk constraints are simply 1 and 
fcmax, if we neglect any structural constraints that a given 
form of the degree distribution imposes on possible JDDs. 
For example, if Ni > k max N kmXL , then fc ran (l) cannot be 
fcmax, it is necessarily less than fc max . Scale- free networks 
with 7 < 3 have these constraints for links connecting 
nodes of degrees fc and k' such that fcfc' > kN [2lJ. To 
formally see that without such constraints the minimum 
and maximum of k nn (k) is 1 and fc m ax, let /ikk' = l + ^fcfc' 
be the factor taking care of links between nodes of the 
same degree in Mkk' = Hkk' Nkk' , so that the total num- 
ber M k of "edge ends" (stubs) attached to degree-fc nodes 
is Mk = kNk = Ylk' Mkk' ■ We then have, by definition, 

k nn {k) = —Y,k' M kk ' ■ (17) 
k fc' 

(The more common definition for the normalized dis- 
tributions P(k) = N k /N and P(fc,fc') = M_ kk '/(kN) 
such that £fcP(fc) = E fc fc' P (M') = 1 is k nn (k) = 
T lk ' k ' p ( k '\ k ) = k /( kP ( k ))J2 k ' k'P{k',k).) The mini- 
mum (maximum) values of k nn (fc) are achieved when all 
degree-fc nodes are attached only to the nodes with the 
minimum (maximum) degrees, 



k n™{ k ) — 




where the minimum (maximum) is taken over all possible 
JDD matrices M kk ' yielding the given degree distribution 
M k . We thus see that the maximum difference between 
possible values of k nn (k) is 

A(fc) = CT(fc) - CT(fc) = ^nax ~ 1. (18) 

In the Internet the maximum node degree is large (it 

scales as fc ma x ~ HH), and hence A(fc) w fc max . 

Suppose now that 4> k = X^fc ^fcfc' = fk + N k k is 
given as a constraint. Note that cj>k is not precisely equal 
to ifk, but we neglect this extra Nkk term here as well, 
partly because in the Internet, Nkk is relatively small 
for almost all fc. Introducing ratio ctk — 4>k/Mk, which is 
approximately the ratio of the number of links connecting 
degree-fc nodes and nodes of higher degrees to the number 
of all links attached to degree-fc nodes, we write the new 
minimum value of k nn (fc) as 
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'fe-i 



k-l 



Kn( k Wk) = -^r- \ min ( Y, k ' M kk' I Mkk ' =M k -<t>k)+ min ( k ' M ™ I £ Mfcfc ' 



M fe 
1 



\fe'=i 



fe'=i 



{1 ■ (M fc - fc ) + fc • fe } = (fc - l)a fc + 1, 



\k'=k 



k'=k 



(19) 



where the minimum is now taken over all JDDs Mkk 1 sat- 
isfying the RCC constraints. Similarly, for the maximum 
possible value, we have 



\k\a k ) 



1 

(fcmax — k + l)a k + k - 1 



{(fc - 1) ■ (Affc - 0fc) + ^max ' <M 

(20) 



and the maximum possible difference is 



A(k\a k ) 



\k\a k ) 



\k\a k ) 



2k + 2)a k + k - 2. 



(21) 



Compared to the unconstrained case, the relative de- 
crease of the range of possible values of k nn (k), assuming 
large fc max , is 



A(fc)- A(fcK) 
A(k) 



1 



2 

; 

k fen 



^max 
fe 



1 



if a k ps 0; 
if a fe pa i; 
if a fc ps 1. 



(22) 



In disassortative networks, such as the Internet (l7j , most 
links incident to medium- and high-degree nodes lead 
to low-degree nodes, meaning that a k ps except for 
&Amax "C 1. Given (|2"2")) . we conclude that the RCC 
introduces significant constraints to the JDD, reflected 
even in a JDD's simple summary statistic k nn {k), except 
for lowest degrees k ps 0, and perhaps highest degrees 
k ps fe max , for which our analysis may be not very accu- 
rate since we neglected the structural constraints that are 
relevant in the high-degree zone. We confirm this con- 
clusion in Fig. [5] where we use the RCC in the measured 
Internet topology to compute the RCC-induced relative 
decrease 1 — A(k\a k ) / A(k) of the range of possible val- 
ues of k nn (k). In the medium-degree zone this decrease 
reaches 80%. 



V. CONCLUSION 

Preferential attachment is a robust mechanism that 
may be responsible for the emergence of the power-law 
degree distributions in some complex networks § . How- 
ever, power laws emerge only if the preference kernel is 
a linear function of node degree [l(| [HI ■ If one believes 
that preferential attachment is a driving force, explicit or 




FIG. 6: Relative decrease of the range of possible values of 
k nn (k) imposed by the Internet's RCC. The Internet map 
from [13] is used to compute A(fc) and A(k\a k ) given by 

Eqs. ipfcry 



implicit, behind the evolution of complex networks, then 
the natural question one has to face is why this kernel 
must be exactly linear in so many so different complex 
systems. 

In this paper we argue that even if the preference kernel 
is not linear but slightly superlinear, preferential attach- 
ment may still produce scale-free networks, except that 
it does so not in the asymptotic but in a vast preasymp- 
totic regime. Two key factors contribute to the depth of 
this regime: 1) how close the preference kernel is to being 
linear, and 2) how many links are added per new node. 
These factors allow us to say, informally, that multiple 
links added under slightly superlinear preferential attach- 
ment resurrect power laws, although only by means of 
deepening the preasymptotic regime. 

The asymptotic regime is still degenerate: adding m 
links leads to the asymptotic degree distribution P(k) — > 
<5fc jm . More precisely, the asymptotic network structure 
is a distorted (or "torn") open m-book — a generaliza- 
tion of the known object in topology [l3l ]. The level of 
distortion depends on how close the preference kernel is 
to a linear function. Similar to the m = 1 case (the 
open 1-book is a star), we find an infinite series of con- 
nectivity transitions characterizing the degree of damage 
to the open book structure, as the kernel approaches a 
linear function. 

To explain the success of one particular superlinear 
model — the positive-feedback preference model 0] — in 
capturing not only the degree distribution but also many 
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other important properties of one particular complex net- 
work, the Internet, we analyze the both- way relationship 
between the joint degree distribution (2-point degree cor- 
relations) and rich-club connectivity. The former defines 
the latter, while the latter constrains the former. These 
constraints, captured by the model, suffice to reproduce 
many other important Internet's properties, since it has 
been shown that most of them, except clustering, depend 
only on the joint degree distribution [1, 0|- 

Given that the depth of the preasymptotic regime in- 
creases with the number m of links added per node, and 
that the average degrees k w 2m of some complex net- 
works including the Internet have been reported to grow 
with network size [H, [23|, 0|j our findings, taken alto- 
gether, imply that some complex networks may exist in 
vast preasymptotic regimes of evolution processes that 
have degenerate network formations as their asymptotes. 
We contrast this implication with the observation that 
the vast majority of the existing network evolution mod- 
els are designed with the goal to yield asymptotic power- 
law distributions, quickly achievable at small network 
sizes. 

An interesting open question is whether the dynamics 
of the world economy supports our findings. Specifically, 
does the superlinear growth of wealth contribute to 
such effects as the "shrinking middle class" [25|, [26| and 
growing wealth inequality [27|, [28|? More succinctly, is 
the Pareto distribution preasymptotic [23l. [29l|? 
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APPENDIX: NONEXTREMAL GROWTH 

The network remains an open book throughout its evo- 
lution with probability 



3=2 



(A.l) 



where Tj^j+i is the probability of attaching the new 
node to the two nodes of highest degree 

(J l) s + (J 2)' + (j - 3) • 2* + 1 Qj 
and we used the shorthand notation 



Qi = 
+ 



i 



(j - 2)* + (j - 3) • 2*' + 1 
1 

(j - 1? + (j - 3) ■ 2* + 1 



When 5 > 2, the probability to remain an open book 
is finite, although it vanishes very rapidly when S ap- 
proaches to 2 from above: 



T 



exp 



(» 



5-2 



(A.2) 



When (5^2, the exact open book structure will be 
certainly destroyed at some moment. A sufficiently large 
network is thus not an open book exactly, yet the devi- 
ation from this structure is rather small. Consider for 
concreteness the range 3/2 < 5 < 2 where the number of 
degree-3 nodes keeps growing while the number of nodes 
of degrees ^ 4 remains finite. The degree sequence reads 



(^2,3^^3, 2,.. .,2) 

N 3 N-N 3 



(A.3) 



where k\ and k 2 are the highest degrees, and where we 
have not displayed a finite number of other nodes whose 
degrees are different from 2 and 3. The two highest de- 
grees ki and k 2 are slightly smaller than N. To deter- 
mine k\ and k 2 we first recall that the sum of all degrees 
is twice the total number of links, 



N 
3=1 



2L 



(A.4) 



Since L = 2N — 4 when m — 2 and the initial sequence 
is (2, 1, 1), we use (|A.3|) and re- write (|A.4[) as 

^ + k 2 + 3N 3 + 2(N - N 3 ) = 4N + 0(1) (A.5) 

from which ki + k 2 = 2N — N 3 + 0(1). Combining this 
relation with inequalities ki < N and k 2 < N, we obtain 



h=N-pN 3 , k 2 = N-(l-p)N 3 



(A.6) 



We now argue that p = 1/2. Indeed, in the leading or- 
der the difference k\ — k 2 evolves according to the rate 
equation 



dN 



kt 



4 N ■ 



2 ■ N s 
S ■ 2 5 - 1 



N 5 

(h - k 2 ) 



(A.7) 



This suggests that k\ — k 2 remains finite and therefore 
supports (|A.6[) with p = 1/2. While the latter assertion 
is correct, equation (|A. 7|) just shows that bias in favor of 
the node of the highest degree k\ over the second high- 
est degree k 2 is too small. However, there remain pure 
stochastic fluctuations, and the difference k\ —k 2 is there- 
fore a random variable of the order of \/N 3 . Thus 



ki = N-±N 3 , 



h-k 2 = 0(y/N 3 ) 



(A. 



Let us now compute A^. In the leading order we have 
dN 3 



dN 



(A.9) 
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where for T/vwjv+i we should ignore 0(N 3 ) corrections, 
N 5 + N 5 N s 



N s + N s + N ■ 2 s N s + N ■ 2 s 



(A.10) 



Plugging (jA.lOp into (|A.9|) and keeping only the leading 
contribution we get 



dN 3 _ / 2 
dN \N 



8-1 



(A.ll) 



which leads to N 3 = sJV 24 ' from ©. 

To extract the sub-leading term, both (|A.9p and (|A.10[) 
should be modified. To modify Vn^n+i we use (|A.3|) 
and (|A.8[) and get a more accurate formula for 

rp = 2-(N-N 3 /2) s 

N " N+1 2 • (N - N 3 /2f + (N — N 3 )2 S + N 3 -3 5 

(N - N 3 /2) s 

(N - N 3 /2) s + (N - N 3 ) -2 S + N 3 -3 S 

The modification of (|A.9p is 



dN 3 N 3 -3 d 
— _l-3W +1 -3- 



(A.12) 



where the last term on the right-hand side assures that 
whenever the new node links to a node of degree 3, we 
have a loss rather than gain. After lengthy calculations 
one gets 

N 3 (N)~i aN2 ; S s + °W 2S if3 / 2 </< 2 (A.13) 
dV ' \a N 2 - s - bN 3 - 2S if S < 3/2 V ; 

Strictly speaking, in writing TVi->./v+i we assumed that 
5 > 3/2. However, a more detailed analysis shows that 
the nodes of degree 4 do not influence the sub-leading 
correction bN 3 ~ 2S . 



When 4/3 < 5 < 3/2, the nodes of degree 4 become 
visible, and the network degree sequence becomes 

(fci,fc2,4,...,4,3,...,3, 2,...,2) (A.14) 

N 4 N 3 N-N3-N4 

A straightforward generalization of our previous argu- 
ment gives 

fci = N - i iV 3 - JV 4 , fci - k 2 = O(V^s) (A-15) 

In the leading order, the quantity N 4 evolves according 
to 



dN 4 N 3 -3 5 N 3 - 3 s 



dN 2 ■ N s N s 



(A.16) 



from which 



N 4 (N) = a 4 N 



3-25 



«4 



2 .3-26 

Proceeding the same way we obtain for any k 2 



(A.17) 



dN k+1 3 N k ■ k s 



dN 2 N s 
leading to the asymptotic 

N k+1 (N) = a k+1 N k -^ 5 

with amplitudes 

fc-2 k 



ak+i = a I - 



n 

3=3 



j 



J - (J - 1)8 



(A.18) 



(A.19) 



(A.20) 
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